555win cung cấp cho bạn một cách thuận tiện, an toàn và đáng tin cậy [stt về gà đá]
Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks. Enterprise-grade STT made refreshingly simple (seriously, see benchmarks). We provide quality comparable to …
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. - …
28 thg 12, 2023 · Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式 - jianchang512/stt
[Blog] [Paper] [Model card] [Colab example] Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can …
Coqui STT (🐸STT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. 🐸STT is battle tested in both production and research 🚀
A free self-hosted STT for VRChat. Contribute to yum-food/TaSTT development by creating an account on GitHub.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription. - KoljaB/RealtimeSTT
The project enables a voice-driven conversation: spoken language is converted into text, passed to an LLM, and the LLM's response is read aloud via text-to-speech. - Stephan-01/LLM …
一个基于FastAPI的语音服务系统,集成了语音合成 (TTS)和语音识别 (STT)功能。本项目使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,提供高质量的语音服务API。
About A fully offline voice assistant that combines lmstudio and applio together. Uses two methods of TTS, STT and also has some extra features.
Bài viết được đề xuất: